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Abstract. Text-To-Speech synthesizers (TTS) have raised the interest of researchers 
and teachers for their ability to enhance foreign/second language (L2) learning, 
particularly with regards to the development of pronunciation skills (Liakin, 
Cardoso, & Liakina, 2017). Despite some optimistic results, there are no studies 
that investigate TTS’s pedagogical potential to enhance L2 Phonological Awareness 
(PA), especially in foreign language contexts, where access to rich aural input is 
limited in terms of both quantity and quality. The present study examines TTS’s 
pedagogical potential as a tool to assist English L2 learners develop their PA, 
focusing on the morphophonological alternations that characterize regular past tense 
marking in English (past -ed). Results show that TTS contributed positively for the 
auditory perception and controlled (but not spontaneous) production of the targeted 


phenomenon. 
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1. Introduction 


Research has shown that the pedagogical use of TTS has the potential to contribute 
to learning in the acquisition of L2 vocabulary and pronunciation (e.g. Liakin et al., 
2017; Soler-Urzta, 2011). Recently, there have been studies that evaluate the voice 
quality of TTS systems, corroborating previous hypotheses that current synthesized 
voices are ready for use in L2 education: they are not only perceived as appropriate 
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(or ‘good enough’) from a sound perspective (Bione, Grimshaw, & Cardoso, 2017; 
Cardoso, Smith, & Garcia Fuentes, 2015), but they are also more likely to enhance 
opaque features of the target language (e.g. regular past tense marking in English is 
often detected with higher accuracy in synthesized speech; Bione et al., 2017; John 
& Cardoso, 2017). One area that has not received consideration by researchers is 
TTS’ ability to raise L2 learners’ PA. 


As such, this pilot study attempts to contribute to the literature on CALL-assisted 
PA. It focused on TTS’s ability to provide the type of input necessary to develop 
learners’ ability to become aware that past tense marking (past -ed) in English 
is characterized by three allomorphs: /t/ in the presence of a preceding voiceless 
segment (e.g. talk[t]), /Id/ when the targeted morpheme is preceded by homorganic 
/t/ or /d/ (e.g. add[Id]), and /d/ elsewhere (e.g. play[d]). To assess participants’ 
PA development, we included tasks that evaluated their pre-/post-test awareness 
of past -ed pronunciation, their auditory perception of the phenomenon, and 
their controlled/spontaneous oral production. The general question that guided 
this research was: can English as a Foreign Language (EFL) learners benefit 
from exposure to synthesized voices for developing their awareness to past -ed 
allomorphy? 


2. Method 


Nine Brazilian Portuguese (BP) native speakers participated in this study. They 
were EFL students living in Joao Pessoa (Brazil, with ages ranging from 25 to 35. 
They had all completed (or were attending) secondary school. 


The study consisted of two hour one-shot sessions with the participants. For the 
treatment, participants were asked to complete a set of learning activities using 
a popular NeoSpeech synthesized voice, Julie (via the NaturalReader software). 
They consisted of activities that encouraged them to listen and respond to question 
such as listen and decide whether the action took place in the present or past. 


For data collection, participants were asked to complete a series of tasks designed 
to evaluate their PA before and after the use of TTS during the treatment phase. 
To examine learners’ PA, participants completed two questionnaires (pre- and 
post-tests), in which they were asked to evaluate ten propositions about English 
pronunciation using an eight point Likert scale, according to Alves’s (2012) five 
levels of PA: L2 phonemes, L2 allophones, identification of non-distinctive sounds 
in L1 and distinctive sounds in L2, syllabic patterns, and rhymes. 
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Based on Cardoso et al. (2015) and Bione et al. (2017), participants also performed 
an auditory identification task, in which they heard 16 sentences, 12 of them in the 
past (four distractors), but without lexical elements that could identify their tense. 


To evaluate the controlled production of past -ed, participants read a list of 26 words 
aloud (14 were regular past verbs, the others were distractors). To evaluate less 
controlled (spontaneous henceforth) production, participants were asked to answer 
questions about the past holidays of fictional characters, using a set of regular past 
verbs. The key difference between these tasks was the amount of cognitive effort 
to complete them: while the first relied on the participants’ ability to produce ed- 
final words in isolation, the spontaneous task required them to focus on content, 
allowing us to obtain more natural speech (Gomes, 2015; Ortiz & Ivette, 2005; see 
Laboy, 1972 for a sociolinguistic perspective). 


3. Results 


3.1. Auditory perception 
Due to the limited scope of the study and the small number of participants, we 
report the results using descriptive statistics, in percentages. As Figure | indicates, 


there was a 7% improvement in past tense perception from pre-test to post-test. 


Figure 1. Auditory identification of past -ed: pre-test and post-test 
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These results suggest that access to synthesized L2 input may contribute to a better 
perception of sentential verbal tense, as well as the identification of regular past 
-ed in English. 
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3.2. Production 


Comparing pre-test to post-test results in controlled production, we observed an 
overall improvement of 12% in past -ed pronunciation, as shown in Figure 2. 


Figure 2. Controlled production of past -ed: pre-test and post-test 
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Interestingly, data from spontaneous production yielded a different pattern, 
showing a reduction in past -ed marking on the post-test, as illustrated in 
Figure 3. 


Figure 3. Spontaneous production of past -ed: pre-test and post-test 


100% 
80% 
9 
60% 47% 42% 
40% 


20% 


0% 
Pre-test Post-test 


In sum, our findings for oral production yielded mixed results regarding the 
pedagogical effects of TTS: While an improvement in past -ed pronunciation was 
observed in controlled production, the reverse was found in spontaneous tasks. 
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3.3. PA test 


Figure 4 shows an improvement in past -ed PA in three of the five levels of 
phonological awareness considered: L2 phonemes (from 50% to 88.89%), L2 
allophones (from 33.33% to 44.44 %), and identification of non-distinctive sounds 
in L1 (allophones) and distinctive in L2 (from 44.44% to 55.56%). Regarding the 
other two levels (syllabic patterns and rhymes), PA decreased from 61.11% to 
27.78% and from 72.22% to 50.00%, respectively. 


Figure 4. PA tests: pre-test and post-test 
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Overall, considering the five levels of PA adopted, the participants’ phonological 
awareness only increased by a mere 1.11% on the post-test (from 52.22% to 53.33%). 


4. Discussions and conclusions 


This study aimed to answer whether EFL learners could benefit from exposure 
to synthesized voices for developing their awareness to past -ed allomorphy. Our 
findings suggest that the pedagogical use of TTS can be effective in at least two 
of the measures encompassed by our definition of PA: auditory -ed perception and 
its controlled production. This can be explained by the fact that these two tasks 
required a focus on language (e.g. in comparison with spontaneous production). 
These findings suggest that, considering these two measures, TTS constitutes 
a good source of aural input (as reported Cardoso et al., 2015 and Bione et al., 
2017), and that it may be used to enhance the opaque alternations found in past 
-ed marking in English (as hypothesized, based on Bione et al., 2017, and John & 
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Cardoso, 2017). We acknowledge some of the limitations of our study, particularly 
the limited number of participants and lack of statistical rigour in our analysis. In 
future research, we would like to mitigate these limitations and verify if, given a 
larger sample, there will be a more significant improvement at post-test. 
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